Picture for Martin Schmid

Martin Schmid

Meta-Learning in Self-Play Regret Minimization

Add code
Apr 26, 2025
Viaarxiv icon

Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents

Add code
Apr 25, 2024
Viaarxiv icon

Learning not to Regret

Add code
Mar 02, 2023
Viaarxiv icon

Player of Games

Add code
Dec 06, 2021
Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

Search in Imperfect Information Games

Add code
Nov 10, 2021
Figure 1 for Search in Imperfect Information Games
Figure 2 for Search in Imperfect Information Games
Figure 3 for Search in Imperfect Information Games
Figure 4 for Search in Imperfect Information Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Add code
Jan 11, 2021
Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Apr 20, 2020
Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Add code
Jul 22, 2019
Figure 1 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 2 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 3 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Viaarxiv icon

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Add code
Jun 26, 2019
Figure 1 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 2 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 3 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 4 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Viaarxiv icon